• Àüü
  • ÀüÀÚ/Àü±â
  • Åë½Å
  • ÄÄÇ»ÅÍ
´Ý±â

»çÀÌÆ®¸Ê

Loading..

Please wait....

Çмú´ëȸ ÇÁ·Î½Ãµù

Ȩ Ȩ > ¿¬±¸¹®Çå > Çмú´ëȸ ÇÁ·Î½Ãµù > Çѱ¹Á¤º¸°úÇÐȸ Çмú´ëȸ > 2015³â µ¿°èÇмú¹ßǥȸ

2015³â µ¿°èÇмú¹ßǥȸ

Current Result Document :

ÇѱÛÁ¦¸ñ(Korean Title) ºÐ»ê ȯ°æ¿¡¼­ Count-min Sketch¸¦ ÀÌ¿ëÇÑ Top-K ºóµµ °Ë»ö
¿µ¹®Á¦¸ñ(English Title) Finding Top-K Frequent Route with Distributed Count-Min Sketch
ÀúÀÚ(Author) Æĵô¶ó Äí¸£È÷³ª ǪƮ¸®   ¾È¼º¾Æ   ±ÇÁØÈ£   Fadhilah Kurnia Putri   Seonga An   Joono Kwon  
¿ø¹®¼ö·Ïó(Citation) VOL 42 NO. 02 PP. 0052 ~ 0054 (2015. 12)
Çѱ۳»¿ë
(Korean Abstract)
¿µ¹®³»¿ë
(English Abstract)
Due to the huge volume of data size, the large-scale data anlysis is interesting issue nowadays. among those, analyzing traffic data is one of meaningful contribution since traffic congestion is serious problem. Based on this concern, by analyzing the frequency of road trip events, after this we can identify which route within a city has most frequenct trip. However a scalability problem would occur when we try to convert the traffic data into the analyzed frequency information due to the large size of traffic data. In this paper, we present Top-K query processing system based on a distributed Count-Min Sketch algorithm for taxi tip events. First, we analyze raw taxi trip events with Count-Min Sketch using sub-linear space, which demands less memory space than the nuuber of distinct elements. The query processing for frequent route is done with Spark SQL. Our approach is implemented on Apache Spark, the cluster computing framework. We validate our approach using real taxi trip events of New York City.
Å°¿öµå(Keyword)   
ÆÄÀÏ÷ºÎ PDF ´Ù¿î·Îµå